Obstruent Consonant Landmark Detection in Thai Continuous Speech
نویسنده
چکیده
The presence of obstruent consonants constitutes key landmark events with cues that indicate abrupt acoustic discontinuities in the speech signal. Such discontinuities allow further analysis and recognition to be performed in knowledge-based speech recognition systems. This paper describes an acoustical investigation on Thai obstruent consonant detection using average level crossing rate (ALCR) information. Simple and easy to compute, ALCR information alone was successfully used in an automatic speech segmentation system for English. Comparable and, in some cases, slightly better performance than the spectraldomain methods using the Mel frequency cepstrum coefficients (MFCC) was reported. However, ALCR has never been applied to Thai. As a result, the objective of the study is to apply ALCR information to ascertain its usefulness in detecting significant temporal changes involving obstruent consonants in Thai continuous speech. Preliminary results suggest that ALCR and RMS energy can be combined to detect the phonetic boundary between initial obstruent consonant and preceding/following vowel or final consonant of the preceding syllable. An experiment was conducted on a small speech corpus containing 21 sentences designed to highlight the occurrences of all 21 possible leading consonants in various syllable structures. The overall detection rate is 83.5% for data from four speakers. The proposed method also reduces the insertion error due to amplitude variations within a phonetic segment.
منابع مشابه
Consonant landmark detection for speech recognition
This thesis focuses on the detection of abrupt acoustic discontinuities in the speech signal, which constitute landmarks for consonant sounds. Because a large amount of phonetic information is concentrated near acoustic discontinuities, more focused speech analysis and recognition can be performed based on the landmarks. Three types of consonant landmarks are defined according to its characteri...
متن کاملLandmark detection for distinctive feature-based speech recognition
This work is a component of a proposed knowledge-based speech recognition system which uses landmarks to guide the search for distinctive features. In the speech signal, landmarks identify times when the acoustic manifestations of the linguistically motivated distinctive features are most salient. This paper describes an algorithm for automatically detecting acoustically abrupt landmarks. Some ...
متن کاملPerceptual Representation of Consonant Sounds in Thai
This work is an attempt to construct a perceptual representation of Thai consonants based on perceptual identification results (from 28 Thais) of 21 phonemes presented in noise. The experiment is designed to equally make pairwise comparisons among 21 word-initial phonemes, which results in 210 real-word stimulus pairs. Percent correct responses and confusion matrices are obtained. Similarity sc...
متن کاملContribution of consonant landmarks to speech recognition in simulated acoustic-electric hearing.
OBJECTIVES The purpose of this study is to assess the contribution of information provided by obstruent consonants (e.g., stops and fricatives) to speech intelligibility in simulated acoustic-electric hearing. As a secondary objective, this study examines the performance of an objective measure that can potentially be used for predicting the intelligibility of vocoded speech. DESIGN Noise-cor...
متن کامل